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Discrete differential geometry of triangle 
tiles and algebra of closed trajectories 



By NAOTO Morikawa 



1. Introduction 

This paper proposes a new mathematical framework that can be applied 
to biological problems such as analysis of the structures of proteins and protein 
complexes. 

In particular, it gives a new method for encoding the three-dimensional 
structure of a protein into a binary sequence, where proteins are approximated 
by a folded tetrahedron sequence. The feature of the method is the correspon- 
dence between protein folding and "integration". And the binary code of a 
protein is obtained as the "second derivative" of the native folded structure of 
the protein. With this method at hand, it becomes possible to describe the 
structure of a protein without any subjective hierarchical classification. 

It also gives a new algebraic framework for describing molecular complexes 
and their interactions. 

First we describe the biological background briefly. 



1.1. The three-dimensional structures of proteins 

Protein is a sequence of amino-acids linked by peptide bonds, where the order 
of amino-acids are encoded by gene. In nature proteins are folded into a well- 
defined three-dimensional structure (native state) and this process is called 
protein folding. Protein folding is reproducible and a protein always folds into 
the particular naive state rapidly. The native state is characterized by its free 
energy which is significantly lower than that of alternative structures. (See [2] 
for more information.) 

The functional properties of proteins depend upon their three-dimensional 
structures. For example, the three-dimensional structure of a protein deter- 
mines the active site of an enzyme, binding site of a drug, or binding site for 
another protein. These active and binding sites are key to understanding how 
proteins interact with other proteins in the cell and how particular molecular 
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Figure 1: Basic ideas, (a): Coordinate axes, where v\ = (0,0,0), v x = (1,0,0), 
v y = (0,1,0), and v z = (0,0,1). (b): Unit cube in M 3 and the projection of 
the upper faces, (c): Slant-tiles over a fiat-tile, where s = v a [xiX2], cr(s) = 



J axy 



[x2,x\\, and so on. 



targets interact with drugs. That is, the knowledge of the three-dimensional 
structures is crucial to the study of protein function. 

Currently, with the growing database of known protein structures, the 
classification of the structures plays a central role in understanding the princi- 
ples of protein structure and function. So far, the structures are classified in a 
structural hierarchy based on similarity measures. But classification schemes 
are to some extent subjective and are not amenable to automation. As a result, 
there are number of different classifications. 



1.2. The structures of protein complexes 

Biological processes, such as signal transmission, cell-fate regulation, transcrip- 
tion, and so on, are not performed by freely diffusing and occasionally collid- 
ing proteins. Instead, proteins usually do their jobs by forming structured 
ensemble of proteins, that is, protein complexes. And one should analyze the 
structures of protein complexes for a description of biochemical and cellular 
functions. (See [1] and [7] for more information.) 

Frequently, these complexes comprise ten or more subunits. For example, 
70S ribosome consists of 52 proteins and three RNA molecules. But there is no 
way but rendering a protein complex on a computer graphic system to describe 
the topological arrangement of its subunits. 



1.3. Previous works 



Because of the rigidity of covalent bonds between successive amino-acids, the 
only degree of freedom proteins have are rotations around these covalent bonds. 
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And, traditionally, protein backbone structures are represented by the rotation 
angles. (See [2] for more information.) 

[6] proposed a representation of the protein backbone structures based on 
concepts of differential geometry. In their method, a protein is represented as 
broken lines, where each line corresponds to the virtual-bond between the Ca 
atoms of successive residues. And they defined the curvature and torsion at 
each point to describe the native folded structure of a protein. 

As for absolute description of protein structure, [8] proposed a set of ideal- 
ized structures to allow the rigorous and automatic definition of protein topol- 
ogy. And [5] constructed a measure of similarity to classify protein structures 
automatically. On the other hand, [3] used the metric matrix distance geom- 
etry method to present a three-dimensional map of the protein structures in 
which structurally related proteins are represented by spatially adjacent points. 

2. Basic ideas 

The basic idea behind this framework is the following observation: let's 
consider unit cubes in the iV-dimensional Euclidean space ~K N and pile them up 
in the direction from (+oo, +oo, • • • , +oo) to (— oo, — oo, • • • , — oo) and view the 
resulting surface from (— oo, — oo, • • • , — oo) (Fig.l(a)). If one prints a pattern 
on the upper faces of each cubes, he/she obtains a drawing made up of the 
patterns, which defines a flow of N — 1-dimensional polyhedron tiles of N 
faces. We call the N — 1-dimensional polyhedron tiles triangle tiles if N = 3 
and tetrahedron tiles if N = 4. 

In the case of N = 4, we obtain a flow of tetrahedrons which we use 
for encoding of space curves and the structure of proteins. For simplicity, we 
shall explain the framework in the case of N = 3, where we use triangle tile 
sequences for encoding of plane curves and others. 

Consider a unit cube in the three-dimensional Euclidean space M 3 specified 
by the following vertices (Fig. 1(b)): 

v a = (I, m, n) G Z 
Vax = (I + l,m,n) G I? 
v ay = (l,m + l,n) G 1? 
v az = (l,m,n + 1) G I? 
And draw lines v a v aX y, v a v ay z, and v a v aX z on three faces v a v a xVaxyVay, 

VaVaxVaxy 

v ay , and v a v ax v a xyVa of the cube respectively, where 

Vaxy = (I + l,m,n) G 1? 
Vayz = (I, m + 1, n + 1) G Z 3 
Vaxz = (Z + 1, m, 71 + 1) € Z 3 
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Figure 2: Peaks and valleys, (a): Drawing denned by a peak v\ = (0,0,0). 
(b): Drawing denned by three peaks v xy = (1, 1,0), v yz = (0, 1, 1), and v xz = 
(1,0,1). 

Then each face is divided into two slant triangle tiles. For example, triangles 

VaVaxVaxy &nd V a V ay V axy for V a V ax V axy V ay . 

By piling up these cubes in the direction from (+oo, +oo, +oo) to (— oo, 
— oo, — oo), we obtain "peaks and valleys" of cubes with a "drawing of broken 
lines" on them. Note that the drawing is uniquely determined by the peaks 
and divides the surface into sequences of slant triangle tiles. And we obtain a 
division of M 2 into sequences of flat triangle tiles by "viewing the surface from 
(— oo, — oo, — oo)". 

For example, the peaks and valleys defined by a peak, say v± = (0,0,0), 
form an infinite triangular pyramid, where the top vertex is v\ and the three 
edges are defined by the positive part of x-, y-, and z-axes (Fig.2(a)). In this 
case, each of the three slopes of the cone is divided by a collection of parallel 
straight lines into infinite sequences of slant triangle tiles. 

As another example, let's consider the peaks and valleys defined by three 
peaks, say v xy = (1,1,0), v yz = (0,1,1), and v xz = (1,0,1) (Fig.2(b)). Then, 
the drawing defines a closed sequence of six slant triangle tiles v xy v xyz v X 2yz, 
v xz v xyz v X 2 yz , v xz v xyz v xyz 2, v yz v xyz v xyz 2, v yz v xyz v xy 2zt <md v xy v xyz v xy 2z , where 
v X yz = (1,1,1), v x2yz = (2,1,1), v xy2z = (1,2,1), and v xyz2 = (1,1,2). (See 
also Fig.4(b).) 

In the following, we define a vector field on a collection of flat tiles, where 
a sequence of flat tiles induced by the "peaks and valleys" corresponds to a 
trajectory defined by the vector field. 

3. Cones of three-dimensional lattices 

Here we give the precise definition of the "peaks and valleys" of cubes and 
the drawing on them. 

In the first place, we define two kinds of three-dimensional lattices, the 
standard lattice and its conjugate. 
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Definition 3.1 (L 3 and its conjugate L 3 *). L 3 := 1? and L 3 * := 1? , 
where L 3 is embedded in L 3 * and L 3 * is embedded in M 3 by the following 
mappings: 



Note that a point of L3* corresponds to a "fractional point" of L 3 : 
(mi, 1712, m 3 ) G L 3 * i-> ((m 2 + m 3 - mi)/2, (mi + m 3 - m 2 )/2, 

(mi + m 2 - m 3 )/2) G L 3 

if mjS are multiples of two. 

Using three indeterminates xi, x 2 , £3, we obtain the following represen- 
tation of the lattices (Fig.3(a)). 

Lemma 3.2 (Monomial representation of L 3 and L 3 *). 



where y\ = x 2 x 3 , y 2 = x\x 3 , and y 3 = xix 2 . 

Now let's consider two types of infinite triangular pyramids for each lattice. 

Definition 3.3 (Standard cones and roofs). For A C L 3 * , 

Cone A := {(Z 2 + Z 3 , h + k,h + h) G K 3 | {h,l 2 ,h) G R 3 
and 3(ai, a 2 , a 3 ) G A s.t. < k {i = 1, 2, 3)} , 

Roof A:= {{l 2 + hA + hM + h)^R? I (ii,Z 2 ,fe) 

and 3m G N s.t. (Z 2 + Z 3 , Zi + m + Z3, Zi + m + Z 2 ), 
(Z 2 + m + Z 3 , Zi + l 3 , h + h + m), (Z 2 + l 3 + m, Zi + Z 3 + m, Z x + Z 2 ) 
G Cone A} . 

./Vote t/iat Cone A C Roof A for any A C L 3 * . Cone A is called standard cone 
and Roof A is called standard roof. 

Definition 3.4 (Conjugate cones and roofs). For A c L 3 * , we define 
Cone* A := {(h,l 2 ,h) G IR 3 I 3(ai,a 2 ,a 3 ) G A s.t. a* < k (i = 1,2,3)} , 
itoo/M := {(Zi,Z 2 ,Z 3 ) GK 3 I 3m GN s.t. (Zi + m, l 2 , Z 3 ), (Zi, Z 2 + m, Z 3 ), 



(Zi, Z 2 , Z 3 ) G L 3 i-> (Z 2 + Z 3 , Zi + Z 3 , Zi + Z 2 ) G L 3 *, 
(mi,m 2 , m 3 ) G L 3 * h-> (mi, m 2 , m 3 ) G K 3 . 




V - {xi ml x 2 m2 x 3 m3 I (mi,m 2 ,m 3 ) G Z 3 } 



6?/ one-to-one correspondences 




(Zi, Z 2 , Z 3 + m) G Cone* ,4} . 
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Figure 3: Lattices and cones. (a): Two lattices L3 (white) and L3* 
(gray), (b): Cone*{v x ,v y ,v z }. (c): Roof*{v x ,v y ,v z } = Cone*{vi}. (d): 
Roof {v x ,v y ,v z } = Cone {v x , v y , v z }. (e): Cone {vi}. 

Note that Cone* A C Roof* A for any A C L3*. Cone* A is called conjugate 
cone and Roof* A is called conjugate roof. 

For example, Roof {v x ,v y ,v z } = Cone {v x ,v y ,v z } and Roof*{v x ,v y ,v z } 
= Cone*{ Vl }, where v x = (1, 0, 0), v y = (0, 1, 0), and v z = (0, 0, 1) (Fig.3). 
Using indeterminates x±, x 2 , X3, we obtain the following. 

Lemma 3.5 (Monomial representation). For A C L%* , 

Cone A ~ [yiW I (hMs) G ^ 3 and 

3(a 1 ,a 2 , a 3 ) G A s.i. a* < k (i = 1, 2, 3)} , 
ifc>o/ A ~ {j/i'W'V 3 I (h,h,h) G M 3 and 3m G N a.f. 

2/1 W 3 y™ G Gone A (i = 1,2,3)} , 
Cone* A ~ jxi'W 2 ^' 3 | {h,h,h) G M 3 and 

3(ai, a 2 , a 3 ) G A s.i. a; < k (i = 1, 2, 3)} , 
floo/M ~ {xi n x 2 ' 2 X3' 3 I (h,l2,k) G K 3 and 3m G N s.i 

xi^W 2 ^' 3 ^" 1 G ConeM (i = 1,2,3)} . 

We denote the "peaks" of a cone u> by g(w). That is, <?(n;) C L3 (or g(u?) C 
L3*) is the minimal system of elements of w which satisfies w = Cone g(w) (or 
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w = Cone*g(w)). In particular, Roof A = Cone g(Roof A) for A C L3 and 
Roof* A = Cone* g (Roof* A) for A C L 3 *. The boundary surface dw C K 3 of 
a conjugate cone (or roof) w is given by: 

:= < (h,h, h) €w \ max {min{Zi - a±, l 2 — a 2 , h ~ 0-3}} = > . 

[ (ai,a2,a 3 )€g(w) J 

The boundary surface dw C R 3 of a standard cone (or roof) w is given by: 
dw := {(h + h,h + h,h + h) € w \ (h,h,h) G R 3 and 

max {min{li — ai,l 2 — ci2,h — 0,3}} = 0> . 

{a 1 ,a 2 ,a 3 )&g{w) J 

Then, the "peaks and valleys" of cubes are defined as the boundary sur- 
face of a conjugate cone. For example, one defined by a peak v € 1? is 
d(Cone*{v}) and another one defined by three peaks v xy , v yz , and v xz £ 1? is 
d(Cone*{v xy ,v yz ,v xz }). 



4. Triangle tiles 



Next we give the definition of the triangle tiles. For v±, v 2 , and V3 € M 3 , 
we denote the convex hull of them by conv[v±, v 2 , V3], that is, 



conv [v±, v 2 , vs\ := < 



^2 Xm € M. 3 I Ai G R (» = 1, 2, 3) s.t. 



i=l,2,3 



Ai = 1 and A, > (i = 1, 2, 3) 

1=1,2,3 

Definition 4.1 (Slant triangle tiles). For v a = (l,m,ri) C L3* , we de- 
fine six slant triangle tiles (Fig. 1(b)): 



v a [xix 2 \ 

V a [X1X3} 
V a [X2X1] 
V a [X 2 X 3 ] 
V a [x3Xl] 
V a [X3X 2 ] 



= COnV [V a ,Vax,Vaxy\ , 
= COnV [v a ,Vax,Vaxz] , 
= conv [v a ,Vay,V ay x] , 
= COnV [Va,Vay,V a y Z ] , 
= COnV [Va,V az ,V aZ x} , 
= COnV [Va,V az ,V aZ y} , 



where v ax = (I + 1, m, n), v ay = (l,m + 1, n), and v az = (I, m,n + 1), v axy = 
(I + 1, m + 1, n), 77 ayz = (I, m + 1, n + 1), u a:r2/ = (/ + 1, m + 1, n), and v axz = 
(I + 1, m, n + 1) € L3*. VFe denote the collection of all slant triangle tiles by 
S3: 

S3 := {a [x p (i)X p{2) ] I a G L 3 *, p G , 
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where Symg is the collection of all permutations of three letters, that is, the 
symmetric group on three letters. 

Finally, we give the definition of the "view from (— oo, — oo, — oo)" . To 
identify slant triangle tiles in the direction from (+00, +00, +00) to (—00, —00, 
— 00), we consider "shift operator" a on S3 (Fig.l(c)): 

o (a [x pW ■ ■ ■ ^(jv-i)] ) := ax p{ i) [x p{2) • • • x p{N) ] , 

where ax\, 0x2, and 0x3 denote the point of (/ + l,m,n), (l,m + l,n), and 
(l,m,n + 1) G Z 3 respectively for a = (l,m,n). Note that shift operator a 
induces an equivalence relation on S3: 

si ~o- s 2 if and only if 3k G Z s.t. o" fc (si) = s 2 , 

for si, s 2 G S3. 

Definition 4.2 (Flat triangle tiles). A flat triangle tile is an equiv- 
alence class of a slant triangle tile and denoted by s mod a for some s G S3. 
In particular, the collection B3 of all flat triangle tiles is the quotient of S3 by 

B3 := 53/ 

For example, let's consider the cone w = Cone*{v X y,v yz ,v xz } (Fig.2(b)). 
Then, the area of dw enclosed by dwnd(Cone {v\}) consists of six slant triangle 
tiles, v xy [x 3 xi], v xz [x 2 xi], v xz [x2X 3 ], v yz [x\x^, v yz [xix 2 ], and v xy [x3X 2 } G S3 
(Fig. 4(b)). And the corresponding six flat triangle tiles form a regular hexagon, 
that is the "view of the area from (—00, —00, —00)". 

5. Surface decomposition 

Now we give the definition of the "drawing" on the surface of a conjugate 
cone, which divides the surface into sequences of slant triangle tiles. 

Firstly, we define a decomposition of slant tiles contained in the surface 
of a conjugate cone. 

Definition 5.1 (Decomposition by a standard cone). For a conjugate 
cone w\ and a standard cone w 2 , set 

In(wi,w 2 ) := {s G S3 | s C dw\ fl ^2} , 
Out(w\,w 2 ) := {s G S3 I s C dw\ \ w 2 ] , 
Bd(wi, w 2 ) := {s G dw\ \ s £ In(w\,w 2 ) U Out(wi,w 2 )} . 

w 2 is called consistent with w\ if Bd{w\.w 2 ) = 0. 

Then, for a conjugate cone w, the "drawing" on dw is defined as the 
collection of all intersections between dw and the surfaces of the standard 




(a) (b) 

Figure 4: Surface decomposition, (a): w\ = Cone*{v xy , v yz , v xz } (gray) and 
w 2 = Cone {vi} (white), (b): In(w\, w 2 ). 



cones which are consistent with w: 

{dw n d{Cone A) \ A C L 3 and Bd(w, Cone A) = 0} C dw. 

For example, we obtain 

In(wi , w 2 ) = {v xy [x 3 xi] , v xz [x 2 xi] , v xz [x 2 x 3 ] , v yz [xix 3 ] , v yz [xix 2 ] , v xy [x 3 x 2 ] } , 
Bd(wi, w 2 ) = 

for w\ = Cone*{v xy ,v yz ,v xz } and w 2 = Cone {vi} (Fig.4(a)). Then, intersec- 
tion dw\ fl dw 2 gives the boundary of deformed hexagon v xy v x2yz v xz v xyz2 v yz 
v xy2z (Fig.4(b)). 

6. Differential geometry of triangle tiles 
6.1. Differential structure on B 3 

To define "tangent bundle" on B3, we consider the equivalence relation on S3 
induced by ct 3 : 

si ~o-3 s 2 if and only ifBkG'Z s.t. o~ 3k (si) = s 2 , 

for si, s 2 G S3. We denote an ^^-equivalence class by s mod a 3 for some 
seS 3 . 

Definition 6.1 (Tangent bundle on B 3 ). Tangent bundle T[B 3 ] on B 3 
is the quotient of S3 by ~o- 3 : 

T[B3] := S 3 / ~ CT3 , 

7r : T[B 3 ] — ► i? 3 , 7r (s mod a 3 ) := s mod a. 
T[B 3 \ is identified with B3 x {x\x 2l x 2 X3, X1X3} by one-to-one correspondence 

s mod o~ 3 <~ (s mod a,Ds), 
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aX p(l) X p(2) 
aX p(l) 2x p(2) 
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aX p(l) X p(3) 



u 



u 
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s[i] mod o 5 s 



s[i+l]moda 



(a) 



(b) 



Figure 5: Differential structure, (a): Local trajectories specified by a[x p ^x p ^ 2 )] 
G S3, (b): The second derivative along 

where the gradient Ds of s £ S3 is defined by 

Da[x p{1) x p{2 )] ■= Xp^Xp^y 

For each element of T[i?3], we assign a local trajectory. And we shall 
obtain a flow on B3 by patching these local trajectories together. 
We start with the definition of the local trajectories on 53. 

Definition 6.2 (Local trajectories on S3). For a [^(1)^(2)] ^ S3, we 
assign the following four trajectories of length three (Fig. 5(a)): 

{a/x pi 2) [xp(2)Xp(i)] , a [xp(i)X pi 2)] , ax p{1) /x pi3) [x p ( 3) x p(2) ] } 

{a [xp(!)Xp {3) ] , a [x pil) Xp (2) ] , ax p{1) /x p{3) [^(3)^(2)] } 

{a/x p {2) [x p (2)Xp(i)\ , a [xp(i)Xp {2 )\ , ax p{1) [^(2)^(1)]} 

{a [xp(i)X /9 ( 3) ] , a [xp {l) Xp {2 )\ , ax p(1) [x p ^)X p (i)] } 

A flow on S3 is obtained by patching one of these local trajectories together. 

Note that they are mapped onto the same local trajectory on B3 by it. 

Lemma 6.3 (Local trajectories on B 3 ). The local trajectories on S3 at 
a [xp(!)Xp( 2 )] £ S3 induce the following local trajectory on B3 at a [2^(1)^(2)] 
mod a: 

{a [x p (i)X p(3 )] mod a, a [xp^Xp^y] mod a, ax p(1) [xp{2)X p{ x)\ mod a) 

The surface of a conjugate cone defines a flow on S3 and we have the 
following. 



Lemma 6.4. Let {s[i] | 1 < i < n} C S3 be a closed trajectory defined by 
a conjugate cone w. Then, there exist standard roofs w\ and W2 s.t. they are 



DDG OF TRIANGLE TILES 



11 



consistent with w and 

{s[i] | 1 < i < n} = In(w, w\) \ In(w, w 2 ). 

Proof. Suppose that s[i] = a^ii^] (1 < i < n) and set 
w\ = Roof {a-i I 1 < i < n}, 

W2 = Roof {a | 3s € B s.t. s = a[^ p (i)^ p (2)] f° r some p £ Syms}, 

where B = {s G In(w , wi) \ s / s[i] (1 < i < n)}. □ 

Since there are only two choices of succeeding slant tiles for each direction 
at a [^(1)^(2)] G S 3 : 

a/x p (2)[xp(2)X p (i)] or a [x p{1) x p{3) ] 

for one direction and 

axp{i)/x p (z) [x p (z)X p{ 2)\ or ax p{1) [x p{2 )X p{1) ] 

for the other, we obtain the following definition of the "second derivative" 
along a trajectory on S3. 

Definition 6.5 (The second derivative). Let {s[i]} be a trajectory on 
S3. The second derivative D 2 s[i] along {s[i]} is a {U, D} -valued function de- 
fined by 



+ 1]) := 



D 2 (s[i]) ifD(s[i + l]) = D(s\i}), 
-D 2 {s[i]) else, 



where —D := U and —U := D (Fig. 5(b)). In other words, the value of the 
second derivative is negated when the gradient of a trajectory changes. 

Note that we can decode the shape of a trajectory on S3 by the second 
derivative along it, i.e., by an U/D sequence. See below for an example, an 
encoding of the shape of a regular hexagon. 

6.2. Vector field on B 3 induced by a cone 

For a conjugate cone, dw specifies a unique slant triangle tile over each point 
of B3. In other words, dw specifies a section of S3 over B3, which we denote 
by IV 

r w (t) := the unique slant tile s C dw s.t. t = s mod a 
for t € B3. Then, T w induces a vector field over B3. 

Definition 6.6 (The vector field induced by a conjugate cone w). 
X w (s mod a) := DT w (s mod a) G {21X2, X2X3, X1X3}. 
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And any vector field on B 3 is locally denned by a conjugate cone. In 
general, we need more than one cone to specify a vector field because of overlaps 
between slant triangle tiles as in the case of local charts of a manifold of 
conventional differential geometry. 

Let {t[i\} C B 3 be a trajectory defined by a vector field X w . Then the 
second derivative D 2 T W of T w along {t[i}} is given by 



For example, let's consider the conjugate cone w = Cone*{v xy ,v yz ,v xz } 
again (Fig.4). We have seen that the area of dw enclosed by dwnd(Cone {vi}) 
consists of six slant triangle tiles and mapped onto a regular hexagon by ir. 
Actually they form a closed trajectory {t[i] \ 1 < i < 6} of B 3 , where 





= V xy [x 3 X 1 }, 


X w (t[l}) 


= xix 3 , 


D 2 T w (t[l}) 


= D. 


T w (t[2]) 


= Vxz[x2Xl], 


X w (t[2}) 


= X!X 2 , 


D 2 T w (t[2]) 


= u, 


r w (t[3]) 


= v xz [x 2 x 3 ], 


X w (t[3}) 


= x 2 x 3 , 


D 2 T w (t[3}) 


= D. 


r«,(*[4]) 


= Vyz[xiX 3 ], 


X w {t[4]) 


= X!X 3 , 


D 2 T w (t[4]) 


= u, 


r w (t[5]) 


= V yz [xiX 2 ], 


X w (t[5]) 


= X!X 2 , 


D 2 T w (t[5]) 


= D. 


T w (t[6]) 


= V xy [x 3 X 2 ], 


X w (t[6}) 


= x 2 x 3 , 


D 2 T w (t[6}) 


= U. 



As a result, we obtain a binary code of the shape of a regular hexagon: 



Note that, if we had set D 2 T w {t[\}) to U, the binary code would be negated. 

On the other hand, we can compute a trajectory which is encoded by a 
given sequence of U and D. For example, let's consider the binary sequence of 



and construct a section T of S3 over B 3 which induces a trajectory whose 
second derivative is given by the sequence. Suppose r(i[l]) = v xy [x 3 xi] and 
t[2] = v xz [x 2 x±] mod a. Then, we obtain 





= V X y[x 3 Xl], 


DT(t[l}) 


= X!X 3 


r(t[2]) 


= v xz [x 2 Xi], 


DT(t[2}) 


= X\X 2 . 


r(t[3]) 


= v xz [x 2 x 3 ], 


DT(t[3}) 


= x 2 x 3 . 


r(t[4]) 


= Vy z [X!X 3 ], 


DT(t[4}) 


= XlX 3 . 


r(t[5]) 


= Vy z [X!X 2 ], 


DT(t[S\) 


= X\X 2 . 


r(t[6]) 


= v xyz [x 2 x{\, 


DT(t[6]) 


= X\X 2 



The first five tiles i[l],t[2], • • • ,t[5] correspond to the vector field induced by 
Cone* {v xy ,v yz ,v xz } and the last five tiles t[2], t[3], ■ ■ ■ ,t[6] correspond to the 
vector field induced by Cone* {v yz , v xz }. That is, we need two local vector 
fields to cover the trajectory. 




D 2 T w (t[{\) iiX w (t[i + l}) 
-D 2 T w (t\i\) else. 



D-U-D-U-D-U. 



D-U-D-U-D-D 
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7. Algebra of closed trajectories 



7.1. Addition of roofs 

Finally we define addition of roofs to consider an analog of protein complexes. 
Definition 7.1 (Addition of conjugate roofs). For A,Bc L 3 * , 



For example, 

Roof*{v x } + Roof*{v y } + Roof*{v z } = Roof*{ Vl }, 
Roof*{v xy } + Roof*{v yz } + Roof*{v xz } = Roof*{v xy ,v yz ,v xz }. 

And we set \w\ := ir(In(w, Roof g(w)) C B 3 for a roof w of L 3 *, which 
we call the norm of w. For example, 



(Note that Roof*{v xy , v yz , v xz ) = Cone*{v xy ,v yz ,v xz } and see Fig.4(b).) 
In particular, we obtain the following. 

Lemma 7.2. For a finite set {wi} of conjugate roofs, (Jj \wi\ C Ei^il- 
For another example, set 

wi = Roof*{v xy ,v yz ,v xz }, 
w 2 = Roof*{v xy2z -i,v y2 ,v xy }, 

W 3 = ROOf*{Vy2,V X -ly2 Z ,Vy Z }, 

where v xy2z -i = (1,2,-1), v y2 = (0,2,0), and v x - ly2z = (-1,2,1) (Fig.6(a)). 
Note that \wi\ (i = 1,2,3) is a closed trajectory of length six and forms a 
regular hexagon. Then, we obtain 

^1+^2+^3 = Roof*{v xy , V xy - lz2 ,V x _ lyz2 ,V z }, 

\w\\ U \w 2 \ U \wz\ = \w\ + W 2 + Wz\. 

That is, three trajectories \u>i\ (i = 1,2,3) of length six are fused into a tra- 
jectory of length 18 (Fig.6(b)). In other words, \wi +^2 + ^3) is an analog of 
protein complexes made up of three subunits. 

In general, a conjugate roof is not associated with a single trajectory. 
But there may exist a conjugate cone which defines a single closed trajectory 
which sweeps all the closed trajectories of the roof. For example, consider 



Roof* A + RoofB := Roof* A U B. 



\Roof*{ Vl }\ 
\Roof*{v xy ,v yz ,v xz }\ 



\Roof*{v x }\ = --- = \Roof*{v xz }\=% 
{^[^3^1] mod a, v xz [x 2 xi] mod a, 
v xz [x 2 x 3 ] mod a, v yz [xix 3 ] mod a, 
v yz [xix 2 ] mod a, v xy [x 3 x 2 ] mode}. 



14 



NAOTO MORIKAWA 



I 



w, 



w 2 v I. L v yz w 3 

J I u 

V xy2zl V y2 V xly2z 



I 



w 



x^x 

V xy2zl V xly2z 



(a) 



Vxz v z2 
I I 



T 



V XJ'/'S. >^%. ^/S. V x - ly Z 9 

I I 



(b) 



V xz V z2 



*y2 



»x-ly2z 



(c) (d) 

Figure 6: Algebra of closed trajectories, (a): The closed trajectories defined 
by wi, W2, and W3. (b): The closed trajectory defined by W4. (c): The closed 
trajectories defined by W2 + W3 + W5. (d): The closed trajectory defined by 
cone wq. 



w\ +W3 + W5, where W5 = Roof*{v yz , v x -i yz 2, 1^2} (Fig.6(c)). It is associated 
with three closed trajectories and 

W 1 +W 3 + W 5 = Roof*{v xz ,V X y,Vy2,V x - ly 2z,V x -ly Z 2,V z 2} 

Then, wq = Cone*{v xz ,v xy ,Vy2,v x -iy2z,v x -i yZ 2,v Z 2} defines a single closed 
trajectory of \wi + W3 + 105 1 (Fig.6(d)). 

8. Conclusion 

The basic idea of a new mathematical framework that can be applied to 
biological problems such as analysis of the structure of proteins and protein 
complexes has been described. 

If we consider the case of N = A, we would obtain a description of proteins 
as trajectories of tetrahedron tiles, where the structure of a protein is encoded 
into a sequence of U and D ([4]) . And we could extract static structual 
information of proteins directly from genes by comparing U/D sequences with 
the corresponding genes that are sequences of four letters A, T, G, and C. 

Moreover, by assigning roofs of L3* to proteins and their complexes, we 
could describe formations of protein complexes algebraically. 
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